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Abstract 

Fiducial cross-sections for tt production with one or two additional Z?-jets are reported, 
using an integrated luminosity of 20.3 fb”* of proton-proton collisions at a centre-of- 
mass energy of 8 TeV at the Large Hadron Collider, collected with the ATLAS detector. 
The cross-section times branching ratio for tt events with at least one additional b-]e.t 
is measured to be 950 + 70 (stat.) (syst.) fb in the lepton-plus-jets channel and 50 
+ 10 (stat.) ;|;[q (syst.) fb in the e/u channel. The cross-section times branching ratio 
for events with at least two additional Z>-jets is measured to be 19.3 + 3.5 (stat.) + 
5.7 (syst.) fb in the dilepton channel (e/i,///!, and ee) using a method based on tight 
selection criteria, and 13.5 + 3.3 (stat.) + 3.6 (syst.) fb using a looser selection that 
allows the background normalisation to be extracted from data. The latter method also 
measures a value of 1.30 + 0.33 (stat.) + 0.28 (syst.)% for the ratio of ft production with 
two additional (7-jets to tf production with any two additional jets. All measurements are 
in good agreement with recent theory predictions. 
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1 Introduction 


The measurement of top quark pair {tt) production in association with one or more jets containing 
Z>-hadrons (henceforth referred to as ( 7 -jets) is important in providing a detailed understanding of 
quantum chromodynamics (QCD). The most accurate theoretical predictions for these processes are 
fixed-order calculations at next-to-leading order (NLO) accuracy [1-3] in perturbative QCD (pQCD), 
which have been matched to a parton shower [4-6]. These calculations have significant uncertainties 
from missing higher-order terms [7, 8], making direct experimental measurements of this process de¬ 
sirable. The measurement of such cross-sections in fiducial phase-spaces, defined to correspond as 
closely as possible to the acceptance of the ATLAS detector, can be compared to theoretical predic¬ 
tions using the same fiducial requirements. This minimises theoretical extrapolations to phase-space 
regions that are not experimentally measurable. 

Moreover, following the discovery of the Higgs boson [9, 10], the Standard Model prediction for the 
top quark Yukawa coupling can be tested via a measurement of the ttH associated production cross- 
section. Due to the large Higgs branching ratio to ( 7 -quarks, the tJH ttbb channel is promising, but 
suffers from a large and poorly constrained background of events with top pairs and additional ( 7 -jets 
from QCD processes [11-13]. 

Measurements of tt production with additional heavy-flavour jets have been performed by ATLAS 
at = 7 TeV [14] and CMS at = 8 TeV [15, 16]. The ATLAS measurement reported a ratio 
of heavy flavour to all jets produced in association with a tt pair where heavy flavour includes both 
bottom jets as well as charm jets. The CMS measurement is a fiducial measurement of events with 
two leptons and four or more jets, of which at least two are identified as containing a ( 7 -hadron. 

This paper presents measurements of fiducial cross-sections for tt production in association with one 
or two additional (7-jets. Because the top quark decays almost exclusively to a (7-quark and a W boson, 
these processes have three or four (7-jets in the final state. The particle-level objects are required to 
be within the detector acceptance of \r]\ < 2.5, where 77 is the pseudorapidity. ^ The jets are required 
to have transverse momenta above 20 GeV and the electrons and muons to have transverse momenta 
above 25 GeV. The lepton-plus-jets and dilepton {eji) channels^ are used to perform two measurements 
of the cross-section for the production of tt events with at least one additional (7-jet. In both cases, 
the signal cross-section is extracted from a fit to a multivariate discriminant used to identify (7-tagged 
jets [17]. The lepton-plus-jets channel has a higher acceptance times branching ratio, but suffers from 
a significant background of events in which the W boson decays to a c- and a light quark. 

Two analysis techniques are used in the dilepton channel {ee, yu/i and efi) to measure a cross-section for 
the production of tt events with two additional ( 7 -jets. The first, referred to as the cut-based analysis, 
applies very tight selection criteria including a requirement of four ( 7 -tagged jets. This analysis results 
in a high signal-to-background ratio and relies on the Monte Carlo (MC) estimates of the background. 


* ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the 
detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis 
points upwards. Cylindrical coordinates (r, (f>) are used in the transverse plane, (j) being the azimuthal angle around the 
z-axis. The pseudorapidity is defined in terms of the polar angle 9 asrj = -lntan(6/2). Angular distance is measured in 
units of AR = -^(Azy)^ - 1 - {Aijif-. 

^ Unless otherwise specified, “leptons” refers exclusively to electrons and muons. The top quark pair production channels 
are labelled according to the decay of the two W bosons. The lepton-plus-jets channel refers to events where one IV boson 
from a top quark decays to hadrons, the other to an electron or muon (either directly or via a t lepton). Dilepton events 
are those in which both IV bosons decay to an electron or muon. 
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including the ft background with additional jets containing c-quarks (c-jets) or only light quarks and 
gluons (light jets). The second applies a looser selection and extracts the signal cross-section from a 
fit to the distribution of a multivariate b-]e.t identification discriminant. This second method, referred 
to as the fit-based analysis, confirms the validity of the background predictions used in the cut-based 
approach, and offers a measurement of the ratio of cross-sections for events with two additional fj-jets 
and all events with two additional jets. 

The fiducial measurements are made considering both electroweak (e.g. from Z boson decays) and 
QCD production of the additional fj-quarks as signal. In order to compare to NLO pQCD theory 
predictions, the measurements are also presented after subtracting the electroweak processes, ttV {V 
corresponding to a IT or Z boson) and ttH. 

The paper is organised as follows. First, the definitions of the fiducial regions are given in Section 2. 
The ATLAS detector is briefly described in Section 3, followed in Section 4 by a description of the 
data and simulated samples used. Section 5 describes the reconstruction of physics objects in the 
detector and presents the event selection used. The sources of systematic uncertainties affecting the 
measurements are described in Section 6. Section 7 describes the analysis techniques used to extract 
the cross-sections and their uncertainties. The final cross-sections are presented in Section 8 and 
compared to recent theoretical predictions. Finally, Section 9 gives brief conclusions. 


2 Measurement definition 

This section details the particle-level fiducial phase-space definitions. Particle-level object definitions 
that are common to all measurements are described in Section 2.1. The particle-level event selection 
is then discussed in Section 2.2, describing first the fiducial selection used to define the cross-section, 
and then, where relevant, the selection used to define the templates that are fit to the data. 


2.1 Particle-level object definitions 

The particle-level definition of objects is based on particles with a proper lifetime Tparticie > 3 x 10“^ ^ s. 
The definitions used here follow very closely previous ATLAS tt fiducial definitions [18]. Fiducial 
requirements are placed only on jets and charged leptons. 

Electrons and muons: Prompt electrons and muons, i.e. those that are not hadron decay products, 
are considered for the fiducial lepton definition. Electrons and muons are dressed by adding to the 
lepton the four-vector momenta of photons within a cone of size AT? = 0.1 around it. Leptons are 
required to have pj > 25 GeV and \r}\ < 2.5. 

Jets: Jets are obtained by clustering all stable particles, except the leptons, dressed with their asso¬ 
ciated photons, and neutrinos that are not hadron decay products, using the anti-k( algorithm [19-21] 
with a radius parameter R = 0.4. Particles from the underlying event are included in this definition, 
whereas particles from additional inelastic proton-proton collisions (pile-up) are not included. The 
products of hadronically decaying t leptons are thus included within jets. Photons that were used in 
the definition of the dressed leptons are excluded from the jet clustering. Particle jets are required to 
have pt > 20 GeV and \r]\ < 2.5. The px threshold for particle-level jets is optimised to reduce the 
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uncertainty of the measurement; it is chosen to be lower than the 25 GeV threshold used for recon¬ 
structed jets (see Section 5), as jets with a true just below the reconstruction threshold may satisfy 
the event selection requirement due to the jet energy resolution. This effect is enhanced by the steeply 
falling pt spectra for the additional jets. A similar choice is not necessary for electrons and muons 
due to their better energy resolution. 

Jet flavour identification: A jet is defined as a fr-jef by ifs association wifh one or more fr-hadrons 
wifh pt > 5 GeV. To perform fhe mafching befween fr-hadrons and jefs, fhe magnifudes of fhe four- 
momenfa of f^-hadrons are firsf scaled fo a negligible value (in order fo nof alfer normal jef reconsfruc- 
fion), and fhen fhe modified fj-hadron four-momenfa are included in fhe lisf of sfable particle four- 
momenfa upon which fhe jef clusfering algorifhm is run, a procedure known as ghost-matching [22]. 
If a jef confains a (r-hadron afler fhis re-cluslering, if is identified as a (r-jef; similarly, if a jef confains 
no fj-hadron buf is ghosf-mafched fo a c-hadron wifh pt > 5 GeV, if is identified as a c-jef. All ofher 
jefs are considered lighf-llavour jefs. 

Overlap between objects: In order to ensure isolation of all objects considered, events are rejected if 
any of the jets satisfying the fiducial requirements lie within A/? = 0.4 of a dressed, prompt lepton. 


2.2 Fiducial event selection 

The fiducial object definitions given above are used to classify events as signal or background. This is 
described in Section 2.2.1. Section 2.2.2 defines the templates used in the fit-based measurements. 


2.2.1 Signal event selection 

The signal definitions are related to the fiducial definition of either a lepton-plus-jets or a dilepton tt 
decay topology with at least one or at least two extra jets. The classification is based on the number 
of leptons and the number and flavour of the jets passing the fiducial object selection. Cross-section 
measurements are reported in the following three fiducial phase-spaces: 

• ttb lepton-plus-jets refers to tt events with exactly one lepton and at least five jets, of which at 
least three are (r-jets; 

• ttb ep refers to tt events with one electron, one muon, and at least three (j-jets; 

• ttbb dilepton refers to tt events with two leptons and least four (j-jets. 

For the ttbb fiducial region, additional requirements are placed on the invariant mass of the lepton pair. 
For all flavours of lepton pairs, the invariant mass of the two leptons (m^i) must be above 15 GeV. In 
events with same-flavour leptons, must also satisfy \mu - mz\ > 10 GeV, where mz is the mass of 
the Z boson. Table 1 summarises the fiducial definition of all three phase-spaces. 
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Liducial 

Requirement 

ttb 

lepton-plus-jets 

ttb 

en 

ttbb 

dilepton 

Meptons (ft > 25 GeV, I 77 I < 2.5) 

1 

2 

2 

Lepton flavours 

e and ji 

e^i only 

ee, nn and efi 

ma >15 GeV 

- 

- 

yes 

- 91 GeV| > 10 GeV 

- 

- 

yes 

fVjets (ft > 20 GeV, \q\ < 2.5) 

> 5 

> 3 

> 4 

X 6 -jets 

> 3 

> 3 

> 4 

ARfj > 0.4 

yes 

yes 

yes 


Table 1: Summary of the three sets of fiducial selection criteria employed for the ttb and ttbb cross-section 
measurements. The jet-lepton isolation {AR(j) requires AR > 0.4 between any of the jets and the leptons. 


2.2.2 Template definitions 

The measurements based on fits determine the signal and background contributions using templates 
of the fj-tagging discriminant for the various categories of events. Because fj-jets, c-jets and light 
jets give different distributions for the discriminant, the non-signal tt events are split according to the 
flavour of the additional jet(s) in the event. 

In particular, the ttb analyses define the signal template (ttb) using the same requirements on the jets 
as used for the cross-section definition, and similar templates are defined for c-jets (ttc) and light jets 
(ttl). With two additional jets, the ttbb fit-based measurement has a larger number of possible flavour 
combinations. The templates of different combinations are merged if they have similar shapes and 
if they are produced through similar processes. This results in four templates: ttbb, ttbX, ttcX and 
ttlX. 

In addition, because the lepton kinematics do not significantly affect the distributions of the f^-jet 
discriminant, the dilepton fit measurements do not include the lepton requirements in the template 
definitions. For these analyses, a correction for the fiducial acceptance of the leptons thus needs to 
be applied (/fid). The ttb lepton-plus-jets analysis uses the same lepton requirements in defining the 
templates as are used for the signal definition. 

Table 2 shows the complete set of criteria used in the fiducial definitions of the various templates. 
For the lepton-plus-jets analysis, contributions from W ^ cq {q - s, d) decays where the c-hadron 
is matched to one of the fiducial jets are included in the ttc template; this contribution is found to 
dominate over that from tt with additional heavy flavour. 

The ttbb cut-based measurement does not make use of templates for fitting. Events are considered 
as signal if they meet the definition of ttbb in Section 2.2.1; all other tt events are considered back¬ 
ground. 


3 ATLAS detector 

The ATLAS detector [23] at the LHC covers nearly the entire solid angle around the collision point. It 
consists of an inner tracking detector surrounded by a thin superconducting solenoid, electromagnetic 
and hadronic calorimeters, and a muon spectrometer incorporating three large superconducting toroid 


6 








Shorthand notation 
for the templates 

Particle-level event requirements 

ttb lepton-plus-jets 

ttb 

?^leptons — L ?^jets ^ 5 and Ufc-jets ^ 3 

ttc 

^leptons — f ? ^jets — ^ and — 2 and > 1 

ttl 

other events 

ttb e^i 

ttb 

^jets — ^ 3.nd. flh—jets — ^ 

ttc 

^jets — ^ jets — ^ 3^nd f^c—^ets — 1 

ttl 

Other events 

ttbb dilepton fit-based 

ttbb 

^Jets > 4 and nt-jets > 4 

ttbX 

^/?-jets “ 3 

ttcX 

jets “ ^ and jets — 1 

ttlX 

Other events 


Table 2: Particle-level definitions used to classify selected tt events into templates for the likelihood fits. The 
categories depend on the number of jets and number of b- and c-jets within the fiducial region. 


magnets. The inner-deteetor system (ID) is immersed in a 2T axial magnetie field and provides 
eharged-partiele traeking in the range \t]\ < 2.5. 

A high-granularity silieon pixel deteetor eovers the vertex region and typieally provides three meas¬ 
urements per traek, the first hit being normally in the innermost layer. This pixel deteetor is important 
for the reeonstruetion of displaeed vertiees used to identify jets eontaining heavy-flavour hadrons. It 
is followed by a silieon mierostrip traeker, whieh has four layers in the barrel region. These silieon 
deteetors are eomplemented by a transition radiation traeker, which enables radially extended track 
reconstruction up to \t]\ = 2.0. The transition radiation tracker also provides electron identification in¬ 
formation based on the fraction of hits (typically 30 in total) above a higher energy-deposit threshold 
corresponding to transition radiation. The ID reconstructs vertices with spatial resolution better than 
0.1 mm in the direction longitudinal to the beam for vertices with more than ten tracks. 

The calorimeter system covers the pseudorapidity range \t]\ < 4.9. Within the region \t]\ < 3.2, elec¬ 
tromagnetic calorimetry is provided by barrel and endcap high-granularity lead/liquid-argon (LAr) 
electromagnetic calorimeters, with an additional thin LAr presampler covering \t]\ < 1.8, to cor¬ 
rect for energy loss in material upstream of the calorimeters. Hadronic calorimetry is provided by 
a steel/scintillating-tile calorimeter, segmented into three barrel structures within \t]\ < 1.7, and two 
copper/LAr hadronic endcap calorimeters. The solid angle coverage is completed with forward cop- 
per/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic and hadronic measure¬ 
ments respectively. 

The muon spectrometer (MS) comprises separate trigger and high-precision tracking chambers meas¬ 
uring the deflection of muons in a magnetic field generated by superconducting air-core toroids. The 
precision chamber system covers the region \t]\ <2.7 with three layers of monitored drift tubes, com¬ 
plemented by cathode strip chambers in the forward region, where the background is highest. The 
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muon trigger system covers the range I 77 I < 2.4 with resistive-plate chambers in the barrel, and thin- 
gap chambers in the endcap regions. 

A three-level trigger system is used to select interesting events [24]. The Level-1 trigger is implemen¬ 
ted in hardware and uses a subset of detector information to reduce the event rate to a design value of 
at most 75 kHz. This is followed by two software-based trigger levels which together reduce the event 
rate to about 400 Hz. 

4 Data samples and MC simulations 

4.1 Data samples 

The results are based on proton-proton collision data collected with the ATLAS experiment at the 
LHC at a centre-of-mass energy of = 8 TeV in 2012. Only events collected under stable beam 
conditions with all relevant detector subsystems operational are used. Events are selected using single¬ 
lepton triggers with pT thresholds of 24 or 60 GeV for electrons and 24 or 36 GeV for muons. The 
triggers with the lower px threshold include isolation requirements on the candidate lepton in order to 
reduce the trigger rate to an acceptable level. The total integrated luminosity available for the analyses 
is 20.3 fb-^ 


4.2 Signal and background modelling 

The sample composition for all analyses is dominated by tt events. Contributions from other pro¬ 
cesses arise from VT-i-jets, Z-i-jets, single top (f-channel, Wt and ^-channel), dibosons {WW, WZ,ZZ) 
and events with one or more non-prompt or fake leptons from decays of hadrons. In these measure¬ 
ments, ffV(where V corresponds to a IT or Z boson) and ttH events that pass the fiducial selection are 
considered as part of the signal. Results with those processes removed are also provided to allow direct 
comparison to theory predictions at NLO in pQCD matched to parton showers (see Section 4.4). All 
backgrounds are modelled using MC simulation except for the non-prompt or fake lepton background, 
which is obtained from data for the ttb lepton-plus-jets and ttb efi analyses, as described below. 

tt : The nominal sample used to model ti events was generated using the PowhegBox (version 1, 
r2330) NLO generator [25-27], with the NLO CTIO parton distribution function (PDF) [28] assuming 
a top quark mass of 172.5 GeV. It was interfaced to Pythia 6.427 [29] with the CTEQ 6 LI [30] PDF 
and the Perugia20IIC [31] settings for the tunable parameters (hereafter referred to as tune). The 
HDAMP parameter of PowhegBox, which controls the px of the first additional emission beyond the 
Born configuration, was set to nitop = 172.5 GeV. The main effect of this is to regulate the high- 
px emission against which the tt system recoils. In Figures I and 2, tables of event yields, and 
comparison to predictions, the tt sample is normalised to the theoretical calculation of 253][}j pb 
performed at next-to-next-to leading order (NNLO) in QCD that includes resummation of next-to- 
next-to-leading logarithmic (NNFF) soft gluon terms with Top-i-r2.0 [32-37]. The quoted uncertainty 
includes the scale uncertainty and the uncertainties from PDF and as choices. 

tiV : The samples of ttV with up to one additional parton were generated with the MadGraph v5 
generator (vL3.33) [38] with the CTEQ6F1 PDF set. Pythia 6.426 with the AUET2B tune [39] was 
used for showering. The top quark production and decay was performed in MadGraph and tt + Zjy* 


8 


interference was included. The tW samples are normalised to the NLO cross-section predictions [40, 
41]. 

ttH: The ttH process was simulated using NLO matrix elements for pp —> tJH provided by the 
HELAC-Oneloop package [42], interfaced to Pythia 8.175 [43] through PowhegBox [44], also known 
as the PowHEL approach [45]. The matrix-element calculation was performed using the CTIO PDF 
set and the parton shower used the AU2CT10 tune [46]. The sample is normalised to the NLO cross- 
section prediction and uses the SM values for the Higgs boson branching ratios [47]. 

W/Z+jets: Samples of IT-i-jets and Z/y*-i-jets were generated using the Alpgen v2.14 [48] leading- 
order (LO) generator and the CTEQ6L1 PDF set [49]. Parton shower and fragmentation were mod¬ 
elled with Pythia 6.426 [29]. To avoid double-counting of partonic configurations generated by 
both the matrix-element calculation and the parton-shower evolution, a parton-jet matching scheme 
(“MEM matching") [50] was employed. The IT/Z-pjets samples were generated with up to five addi¬ 
tional partons, separately for production in association with fi-quarks, c-quarks and light quarks. The 
overlap between events with heavy-flavour quarks obtained from the matrix element and the parton 
showers was removed using a scheme based on angular separation between the heavy quarks. The 
VT/Z-i-jets backgrounds are normalised to the inclusive NNLO theoretical cross-section [51]. In the 
dilepton channel, a data-driven method is used to validate the Z-i-jets normalisation. A region enriched 
in Z-f-jets events is defined by inverting the requirement Iruee/^/i - 91 GeV| > 10 GeV. The data are 
found to agree with the prediction in all lepton channels. 

Dibosons: Samples of ITlT/lTZ/ZZ-rjets were generated using Alpgen v2.14 [48]. Parton shower 
and fragmentation were modelled with Herwig 6.520 [52]. Sherpa 1.4.3 [53-56] samples including 
massive b- and c-quarks with up to three additional partons were used to cover the WZ channel with the 
Z decaying to hadrons, which was not taken into account in the Alpgen samples. All diboson samples 
are normalised to their NLO theoretical cross-sections [57, 58] as calculated with MCFM [59]; the 
NLO PDF set MSTW2008 was used for all decay channels. 

Single top: Background samples of single top quarks corresponding to the t-channel, ^-channel and 
Wt production mechanisms were generated with PowhegBox (version 1, r2330) [25-27] using the 
CTIO PDF set [28]. All samples were interfaced to Pythia 6.426 [29] with the CTEQ6L1 set of 
parton distribution functions and the Perugia2011C tune. In the dilepton channels, only the ITf process 
is considered. Overlaps between the tt and Wt final states were removed according to the inclusive 
Diagram Removal scheme [60]. The single-top-quark samples are normalised to the approximate 
NNLO theoretical cross-sections [61-63] using the MSTW2008 NNLO PDF set. 

All event generators using Herwig 6.520 [52] were also interfaced to Jimmy v4.31 [64] to simulate 
the underlying event. The samples that used Herwig or Pythia for showering and hadronisation 
were interfaced to Photos [65] for modelling of the QED final-state radiation and Tauola [66] for 
modelling the decays of r leptons. The ttH sample was interfaced to Photos-e-i-. All samples were 
simulated taking into account the effects of multiple pp interactions based on the pile-up conditions 
in the 2012 data. The pile-up interactions are modelled by overlaying simulated hits from events with 
exactly one inelastic (signal) collision per bunch crossing with hits from minimum-bias events that 
are produced with Pythia 8.160 using the A2M tune [46] and the MSTW2008 LO PDF [67]. Finally 
the samples were processed through a simulation [68] of the detector geometry and response using 
Geant4 [69]. All simulated samples were processed through the same reconstruction software as the 
data. Simulated events are corrected so that the object identification efficiencies, energy scales and 
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energy resolutions match those determined in data control samples. The alternative tt samples de¬ 
scribed in Section 6.3, used for evaluating systematic uncertainties, were instead processed with the 
ATLFAST-II [68] simulation. This employs a parameterisation of the response of the electromagnetic 
and hadronic calorimeters, and Geant4 for the other detector components. The nominal tt sample 
is processed with both Geant4 and ATLFAST-II; the latter is used when calculating the generator 
uncertainties. 

Table 3 provides a summary of basic settings of the MC samples used in the analysis. The alternative 
tt samples used to evaluate the tt generator uncertainties are described in Section 6.3. 


Sample 

Generator 

PDF 

Shower 

Normalisation 

tt 

PowhegBox (version 1, r2330) 

CTIO 

Pythia 6.427 

NNLO-hNNLL 

W -H jets 

Alpgen v2.14 

CTEQ6L1 

Pythia 6.426 

NNLO 

Z + jets 

Alpgen v2.14 

CTEQ6L1 

Pythia 6.426 

NNLO 

Single top f-channel 

PowhegBox (version 1, r2330) 

CTIO 

Pythia 6.426 

approx. NNLO 

Single top i-channel 

PowhegBox (version 1, r2330) 

CTIO 

Pythia 6.426 

approx. NNLO 

Single top Wt channel 

PowhegBox (version 1, r2330) 

CTIO 

Pythia 6.426 

approx. NNLO 

WZ (excluding Z —> qq) 

Alpgen v2. 14 

CTEQ6L1 

Herwig 6.520 

NLO 

WZ (Z ^ qq) 

Sherpa 1.4.3 

CTIO 

Sherpa 1.4.3 

NLO 

WW,ZZ 

Alpgen v2. 14 

CTEQ6L1 

Herwig 6.520 

NLO 

ttV 

MadGraphv 5 (vl.3.33) 

CTEQ6L1 

Pythia 6.426 

NLO 

ttH 

POWHEL 

CTIO 

Pythia 8.175 

NLO 


Table 3: Summary of the Monte Carlo event generators used in the analyses. Generators used only for evaluating 
systematic uncertainties are not included. 


4.3 Backgrounds with fake or non-prompt leptons 

Events with fewer prompt leptons than required may satisfy the selection criteria if one or more jets 
are mis-identified as isolated leptons, or if the jets include leptonic decays of hadrons which then 
satisfy lepton identification and isolation requirements. Such cases are referred to as fake leptons. 

In the lepton-plus-jets channel, this background is estimated from data using the so-called matrix 
method [70]. A sample enhanced in fake leptons is selected by removing the lepton isolation re¬ 
quirements and, for electrons, loosening the identification criteria (these requirements are detailed in 
Section 5.1). Next, the efficiency for fhese “loose” leptons to safisfy fhe fighf criteria is measured in 
dafa, separately for prompf and for fake lepfons. For prompf lepfons if is faken from a sample of Z bo¬ 
son decays, while for fake leptons if is esfimafed from evenfs wifh low missing fransverse momenfum 
or high lepton impacf paramefer. Wifh fhis informafion fhe number of fake lepfons safisfying fhe fighf 
criferia can be calculated. 

In fhe ttb e/i analysis, fhis background is estimated from dafa using evenfs where fhe fwo lepfons have 
elecfrical charges wifh fhe same sign. Processes which confain fwo prompf lepfons wifh fhe same 
sign, such as ttW, and cases of lepfon charge mis-idenfificafion, are subfracfed from fhe same-sign 
dafa using MC simulafion. In fhe ttbb measuremenfs, fhe background is less imporfanf, as fhe higher 
jef mulfiplicify requiremenf means fewer additional jefs available fo be mis-idenfified as lepfons. In 
fhis case fhe background is esfimafed from fhe simulafion samples described above. 
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4.4 Predictions for tt with additional heavy flavour 

The measured fiducial cross-sections are compared to a set of theory predictions obtained with the 
generators shown in Table 4. In each case the fiducial phase-space cuts are applied using Rivet 2.2.1 
[71]. 

Two generators are used which employ NLO ttbb matrix elements with the top quarks being pro¬ 
duced on-shell. A MadGraph5_aMC@NLO sample was generated in the massive 4-flavour scheme 
(4FS), using two different functional forms for the renormalisation and factorisation scales: fi = 

(pt(^)Pt(^)) ^ (the BDDP [1] form), and fi = ^ + Pjj, where the sum runs 

over all final-state particles. A Powhel sample was generated as described in Ref. [4], with the top 
quark mass set to 173.2 GeV. The renormalisation and factorisation scales were set to with 

the sum in this case running over all final-state particles in the underlying Born configuration. In con¬ 
trast to MadGraph5_aMC@NLO, this sample employed the 5-tlavour scheme (5FS), which unlike 
the 4FS treats ti-quarks as being massless and contains a resummation of logarithmically enhanced 
terms from collinear g ^ bb splittings [72]. In order to regularise the divergence associated with 
gluon splitting into a pair of massless bb quarks, the transverse momentum of each ti-quark, and the 
invariant mass of the bb pair, were all required to be greater than 2 GeV. This implies that the 5FS 
calculation does not cover the entire phase-space measured by the ttb analyses. However, the missing 
events, in which a second ti-quark is produced with pj below 2 GeV, or two ^-quarks have invari¬ 
ant mass below 2 GeV, are expected to contribute only a small amount to the fiducial cross-section. 
The prediction for the ttbb fiducial cross-section is unaffected by the generator cuts. Both the Mad- 
Graph5_aMC@NLO and Powhel samples used Pythia 8.205 [73] with the Monash tune [74] for the 
parton shower. 

The cross-sections are also compared to predictions in which the additional Zi-quarks are not present in 
the matrix-element calculation and are only created in the parton shower. The PowhegBox sample is 
the same one used for the nominal tt prediction, described in Section 4.2. A merged sample containing 
a tt final state with up to three additional partons {b, c, or light) was generated with MadGraph5 
interfaced to Pythia 6.427 with the Perugia2011C [31] tune. Finally, in order to assess the effect 
of the different descriptions of the g ^ bb splitting in the parton shower, a sample consisting of 
LO tt matrix elements was generated with Pythia 8.205 [73] using the ATTBAR tune [75]. The 
inclusive cross-section of the sample was normalised to the NNLO-i-NNLL result [32-37]. Pythia 8 
offers several options for modelling g ^ bb splittings in the final-state parton showers, which may 
be accessed by varying the Timeshower:weightGluonToQuark (wgtq) parameter [76]. Differences 
between the models arise by neglecting (wgtq5) or retaining (wgtq3, wgtq6) the mass-dependent 
terms in the g ^ bb splitting kernels. Differences also arise with respect to the treatment of the 
high-m ^5 region, with specific models giving an enhanced or suppressed g ^ bb rate. The model 
corresponding to wgtq3 was chosen to maximise this rate. Finally, some of the models (wgtq5, wgtq6) 
offer the possibility to choose sgtq-m^|, instead of the transverse momentum as the argument of as in 
the g ^ bb vertices. Here sgtq refers to the TimeShower:scaleGluonToQuark parameter, and is 
allowed to vary in the range 0.25 < sgtq < 1, with larger values giving a smaller g ^ bb rate and vice 
versa. For the model wgtq5, sgtq was set to 1, a combination that minimises the g ^ bb rate, while 
for wgtq6, sgtq was set to 0.25. 
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Sample 

Generator 

Shower 

PDE 

b mass [GeV] 

Tune 

ttbb 

M adGraph5_aMC @ NEO 

Pythia 8.205 

CT10f4 

4.8 

Monash 

ttbb 

POWHEL 

Pythia 8.205 

CTlOnlo 

0 

Monash 

tt -i-<3 partons 

MadGraph5 

Pythia 6.427 

CTIO 

4.8 

Perugia2011C 

tt 

Pythia 8.205 

Pythia 8.205 

CTEQEl 

4.8 

ATTBAR 

tt 

PowhegBox 

Pythia 6.427 

CTIO 

0 

Perugia2011C 


Table 4; Details of the theoretical cross-section calculations. For MadGraph5_aMC@NLO, two different func¬ 
tional forms are used for the renormalisation and factorisation scales. Additionally, the leading-order Pythia 
calculations were done with three different options for the g ^ bb splitting, as described in the text. The 
PowhegBox sample is the one used for the nominal ff prediction in the analyses. 


5 Object and event selection 

5.1 Object reconstruction 

A description of the main reconstruction and identification criteria applied for electrons, muons, jets 
and f7-jets is given below. 

Electrons: Electron candidates [77] are reconstructed from energy clusters in the electromagnetic 
calorimeter that are matched to reconstructed tracks in the inner detector. The electrons are required 
to have Et > 25 GeV and |? 7 ciusterl < 2.47. Candidates in the electromagnetic calorimeter barrel/endcap 
transition region 1.37 < iT/ciusteij < 1-52 are excluded. The longitudinal impact parameter of the track 
with respect to the primary vertex, |zoL is required to be less than 2 mm. Electrons must satisfy 
tight quality requirements based on the shape of the energy deposit and the match to the track to 
distinguish them from hadrons. Additionally, isolation requirements are imposed based on nearby 
tracks or calorimeter energy deposits. These requirements depend on the electron kinematics and are 
derived to give an efficiency that is constant with respect to the electron Ej and rj. The cell-based 
isolation uses the sum of all calorimeter cell energies within a cone of AE = 0.2 around the electron 
direction while the track-based isolation sums all track momenta within a cone of AE = 0.3; in both 
cases the track momentum itself is excluded from the calculation. A set of isolation selection criteria 
with an efficiency of 90% for prompt electrons inZ ^ ee events is used in the ttb analyses. Due to 
the reduced fake lepton background in the ttbb analyses, a looser 98% efficient set of selection criteria 
is used. 

Muons: Muon candidates are reconstructed by matching tracks formed in the muon spectrometer and 
inner detector. The final candidates are refit using the complete track information from both detector 
systems, and are required to have px > 25 GeV, I 77 I < 2.5, and |zol < 2 mm. Muons must be isolated 
from nearby tracks, using a cone-based algorithm with cone size AEiso - 10GeV/p{^. All tracks 
with momenta above 1 GeV, excluding the muon’s track, are considered in the sum. The ratio of the 
summed track transverse momenta to the muon px is required to be smaller than 5%, corresponding 
to a 97% selection efficiency for prompt muons from Z —> pp decays. If a muon and an electron are 
formed from the same track, the event is rejected. 

Jets: Jets are reconstructed with the anti-kj algorithm [19-21] with a radius parameter E = 0.4, using 
calibrated topological clusters [23] built from energy deposits in the calorimeters. Prior to jet find¬ 
ing, a local cluster calibration scheme is applied to correct the topological cluster energies for the 
non-compensating response of the calorimeter, dead material, and out-of-cluster leakage [78]. The 
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corrections are obtained from simulations of charged and neutral particles. After energy calibration, 
jets are required to have px > 25 GeV and I 77 I < 2.5. To avoid selecting jets from secondary interac¬ 
tions, a jet vertex fraction (JVF) cut is applied [79]. The variable is defined as the ratio of two sums of 
the pt of tracks associated with a given jet and that satisfy px > 1 GeV. In the numerator, the sum is 
restricted to tracks compatible with the primary vertex, while in the denominator the sum includes all 
such tracks. A requirement that its value be above 0.5 is applied to jets with px < 50 GeV, Ipl < 2.4, 
and at least one associated track. 

During jet reconstruction, no distinction is made between identified electrons and other energy de¬ 
posits. Therefore, if any of the jets lie within AT? = 0.2 of a selected electron, the single closest jet 
is discarded in order to avoid double-counting electrons as jets. After this, electrons or muons within 
AT? = 0.4 of a remaining jet are removed. 

^-tagged jets: Jets are identified as likely to originate from the fragmentation of a T^-quark (T^-tagged) 
using multivariate techniques that combine information from the impact parameters of associated 
tracks and topological properties of secondary and tertiary decay vertices reconstructed within the 
jet [17]. The multivariate algorithms are trained either using only light-flavour jets as background (the 
“MVl” algorithm), or additionally including charm jets in the background to improve the charm jet 
rejection (the “MVIc” algorithm). The efficiency of identification in simulation is corrected to that 
measured in data, separately for each flavour of jet [17, 80]. For the analyses using a binned fit of the 
T>-tagging discriminant, the probability for a simulated jet to lie in a particular bin is corrected using 
data. 


5.2 Event selection 

To ensure that events originate from proton collisions, events are required to have at least one recon¬ 
structed vertex with at least five associated tracks. 

Events are required to have exactly one or exactly two selected leptons in the lepton-plus-jets and 
dilepton measurements, respectively. At least one of the leptons must be matched to the trigger object 
which triggered the event. For the ttb cp measurement, only events with one electron and one muon 
are considered. To increase the number of events in the ttbb measurements, all three lepton flavour 
combinations {ee, and cp) are considered. Additional lepton requirements are applied in the ttbb 
analyses to remove the backgrounds from Zjy*, T and 7/i/^ decays. The invariant mass of the two 
leptons must satisfy mu >15 GeV and, for events with same-flavour leptons {ee or pp), must also 
satisfy \mu - 91 GeV| >10 GeV. 

The lepton-plus-jets ttb analysis requires at least five jets, at least two of which must be f 7 -tagged. 
For this analysis, c-jet rejection is important so the MVlc T?-tagging algorithm is used, at a working 
point with 80% efficiency for T^-jets from top quark decays. This working point is optimised to give 
the lowest total expected uncertainty on the measurement. The ttb cp and ttbb fit-based dilepton 
analyses require at least three jets, two of which have to be T^-tagged. The same T^-tagging algorithm 
and working point as in the lepton-plus-jets analysis is used to improve the separation between b- 
and c-jets. The ttbb cut-based analysis requires exactly four T^-tagged jets; for this analysis the MVl 
algorithm is used at a working point with 70% efficiency for T^-jets from top decays. For this analysis, 
the tighter working point is chosen to reduce the background as much as possible, while the MV 1 
algorithm is chosen since the impact of the c-jet background on the analysis is less important. Table 5 
summarises the selection criteria applied to the analyses. 
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Requirement ttb 

ttb 

ttbb 

ttbb 

Lepton-plus-jets 

ep 

Cut-based 

Fit-based 

^leptons 1 

2 

2 

2 

Eleetron isolation efficiency 90% 

90% 

98% 

98% 

mc£ > 15 GeV 

- 

yes 

yes 

InW/i/i - 91 GeV| > 10 GeV 

- 

yes 

yes 

l^jets ^ 5 

> 3 

>4 

> 4 


> 2 

4 

> 2 

Zr-tagging algorithm MVlc @ 80% 

MVlc @ 80% 

MVl @ 70% 

MVlc @ 80% 

Table 5: Summary of the main event selection criteria applied in the various channels, 
which are common to all channels, including muon isolation, are described in the text. 

Other requirements 

After these selection criteria are applied, the number of observed and expected events are shown in 
Table 6 for the ttb analyses and Table 7 for the ttbb analyses. For all but the ttbb cut-based analysis, 
the samples are dominated by tt events with an additional light or charm jet. In all cases the data agree 
with the expectation within the systematic uncertainties described in Section 6. The kinematics in all 
channels are also found to be well-modelled. As an example, Figure 1 shows the jet multiplicity, b- 
tagged jet multiplicity, and pj distribution of the jet with the third highest MVlc weight in the lepton- 
plus-jets selection. Figure 2 shows the Zr-tagged jet multiplicity along with the pT distribution of the 
jets with the third and fourth highest MVIc values in the dilepton selection. The jet pT distributions 
in Figures 1 and 2 correspond to the jets that are used in the fit to the distributions of the f?-tagging 

discriminant MVlc (see Section 7.2). 




Component 

Fepton-plus-jets 

ttb ep 


tt 

108600 ±7500 

6620 ±710 


ttb 

5230 ± 330 

286 ±27 


ttV signal 

61 ±61 

3.6 ±3.6 


ttH signal 

140 ±140 

10 ± 10 


ttc 

43300 ± 3000 

629 ± 57 


ttl 

60100 ±6800 

5700 ±630 


VT-rjets 

6700 ± 3500 

- 


Single top 

5490 ±760 

216±58 


Z-i-jets 

1640 ±860 

20 ± 11 


Diboson 

510±140 

8.8 ±3.3 


Fake and non-prompt leptons 

1790 ± 890 

50 ±25 


Total prediction 

124800 ± 8400 

6910 ±720 


Data 

129743 

7198 



Table 6: The number of observed and expected events in the ttb lepton-plus-jets and ep analysis signal regions. 
Indented sub-categories indicate that they are subsets of the preceding category. The uncertainty represents the 
total uncertainty (pre-fit) on the Monte Carlo samples, or on data events in the case of the fake and non-prompt 
leptons. In the ttb efi channel, only the Z ^ tt contribution is included in Z-i-jets; the rest is accounted for in 
the fake lepton component, as is VT-rjets. The breakdown of the tt sample into the fiducial sub-samples is given, 
using the template definitions. For illustration, the contributions to ttb from ffV and ttH are also shown. 
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Component 

Cut-based 

Fit-based 

tt 

23.8 ±7.2 

5750 ±850 

ttbb 

17.1 ±4.8 

110±35 

ttV signal 

0.59 ±0.59 

2.7 ±2.7 

ttH signal 

1.6 ± 1.6 

7.7 ±7.7 

ttbX 

4.1 ±2.7 

280 ±93 

ttcX 

2.4 ±1.0 

730 ±350 

ttlX 

0.30 ±0.39 

4630 ±670 

Single top 

0.41 ±0.51 

150 ±57 

Z-i-jets 

0.82 ±0.96 

240 ± 46 

Diboson 

<0.1 

10.9 ±3.9 

Fake and non-prompt leptons 

<0.1 

18.1±9.1 

Total prediction 

25.1 ±7.2 

6180 ±890 

Data 

37 

6579 


Table 7; The number of observed and expected events in the two ttbb analysis signal regions. Indented sub¬ 
categories indicate that they are subsets of the preceding category. The uncertainty represents the total uncer¬ 
tainty (pre-fit) on the Monte Carlo samples, or on data events in the case of the fake and non-prompt leptons. 
The breakdown of the tt sample into the fiducial sub-samples is given, using the template definitions. For 
illustration, the contributions to ttbb from ffV and ttH are also shown. 


6 Systematic uncertainties 


Several sources of systematic uncertainty are considered that can affect the normalisation of signal and 
background and/or the shape of their corresponding final discriminant distributions, where relevant. 
Individual sources of systematic uncertainty are considered as correlated between physics processes 
and uncorrelated with all other sources. The following sections describe each of the systematic un¬ 
certainties considered in these analyses. The uncertainties quoted are illustrative only and the effect 
of that uncertainty depends on the channel and analysis method used. All analyses use relative norm¬ 
alisation uncertainties. Section 7 details the method by which the uncertainties are included in each 
analysis and discusses their impact on the measurements. 


6.1 Luminosity uncertainty 

Using beam-separation scans performed in November 2012, a luminosity uncertainty of 2.8% for 
= 8 TeV analyses was derived applying the methodology of Ref. [81]. This uncertainty directly 
affects the cross-section calculation, as well as all background processes determined from MC simu¬ 
lation. 

6.2 Physics objects 

In this section, uncertainties relevant to the reconstruction of leptons, jets, and /^-tagging are de¬ 
scribed. 
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Figure 1; Jet multiplicity, ^-tagged jet multiplicity, and transverse momentum pj of the jet with the third highest 
MVlc value in the lepton-plus-jets channel. Events are required to have at least hve jets, at least two h-tagged 
jets and one lepton. The data are shown as black points with their statistical uncertainty. The stacked distri¬ 
butions are the nominal predictions from Monte Carlo simulation; the hashed area shows the total uncertainty 
on the prediction. The bottom sub-plot shows the ratio of the data to the prediction. The non-prompt and fake 
lepton backgrounds are referred to as ‘NP & fakes’. The last bin of the distribution includes the overflow. 


Lepton reconstruction, identification and trigger: The reconstruction and identification efhciency 
of electrons and muons, their isolation, as well as the efficiency of the triggers used to record the 
events, differ slightly between data and simulation. Correction factors are derived using tag-and- 
probe techniques on Z ^ (£ = e,p) data and simulated samples to correct the simulation for 

these discrepancies [82, 83]. These have ~l% uncertainty on all simulated samples. 

Lepton momentum scale and resolution The accuracy of the lepton momentum scale and resolu¬ 
tion in simulation is checked using reconstructed distributions of the Z ^ and Jjip —> 
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Figure 2: Jet multiplicity, /j-tagged jet multiplicity, and transverse momentum pj of the jets with the third and 
fourth highest MVlc values, in the dilepton channel using the ttbb fit-based selection; events are required to 
have at least four jets, two /j-tagged jets and two leptons (ee, ep or pp). The data are shown in black points with 
their statistical uncertainty. The stacked distributions are the nominal predictions from Monte Carlo simulation; 
the hashed area shows the total uncertainty on the prediction. The bottom sub-plot shows the ratio of the data 
to the prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’. The last bin of 
the distribution includes the overflow. 


masses [83, 84]. In the case of electrons, Ejp studies using IT —> ev events are also used. Small 
discrepancies between data and simulation are observed and corrected for. In the case of muons, mo¬ 
mentum scale and resolution corrections are only applied to the simulation, while for electrons these 
corrections are applied to data and simulation. Uncertainties on both the momentum scale and resolu¬ 
tions in the muon spectrometer and the tracking systems are considered, and varied separately. These 
uncertainties have an effect of less than 0.5% on most samples, but up to 1% on a few of the smaller 
backgrounds. 
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Jet reconstruction efficiency: The jet reconstruction efficiency is found to be about 0.2% lower in 
the simulation than in data for jets with pj below 30 GeV, and consistent with data for higher jet 
Pt- To evaluate the systematic uncertainty due to this small inefficiency, 0.2% of the jets with p^ 
below 30 GeV are removed randomly and all jet-related kinematic variables are recomputed. The 
event selection is repeated using the modified selected jet list. These uncertainties have less than a 
0.5% effect on the acceptance of all samples. 

Jet vertex fraction efficiency: The efficiency for each jet to satisfy the jet vertex fraction requirement 
is measured in Z(—> 1-jet events in data and simulation, selecting separately events enriched in 

hard-scatter jets and events enriched in jets from other proton interactions in the same bunch crossing 
(pile-up). The corresponding uncertainty is evaluated in the analysis by changing the nominal JVF 
cut value. This uncertainty has less than a 1% effect on the signal sample, and up to 5% effect on the 
other samples [79, 85]. 

Jet energy scale: The jet energy scale (JES) and its uncertainty have been derived by combining 
information from test-beam data, LHC collision data and simulation [78, 86]. The jet energy scale 
uncertainty is split into 22 uncorrelated sources, each of which can have different jet pj and p depend¬ 
encies. The largest of these components is the uncertainty specifically related fo fj-jefs, which yields 
an uncerfainly of 1.2 - 2.5% on fhe fiducial cross-secfion measuremenfs. 

Jet energy resolution: The jet energy resolution (JER) has been measured separately for data and 
simulation using two in situ techniques [87]. The expected fractional pj resolution for a given jet 
is measured as a function of its pi and pseudorapidity. A systematic uncertainty is defined as the 
difference in quadrature between the JER for data and simulation and is applied as an additional 
smearing to the simulation. This uncertainty is then symmetrised. This uncertainty has a 2-4% effect 
on the acceptance of most samples. 

Flavour tagging uncertainty: The efficiencies for b, c and light jets to satisfy the f^-tagging criteria 
have been evaluated in data, and corresponding correction factors have been derived for jets in sim¬ 
ulation [17, 80]. These scale factors and their uncertainties are applied to each jet depending on its 
flavour and pj. In the case of light-flavour jets, the corrections also depend on jet p. The scale factors 
for T jets are set to those for c jets and an additional extrapolation uncertainty is considered. Eor the 
fit-based analyses, the effect on the shape of the MVlc templates is considered. A covariance matrix 
is formed describing how each source of uncertainty in the scale factor measurement affects each px 
bin. This matrix is diagonalised, leading to a set of statistically independent eigenvectors for each jet. 
The result is 24 uncorrelated uncertainties affecting the t7-jet efficiency, 16 uncorrelated sources each 
for the c-jets and r-jets, and 48 uncorrelated sources affecting the light jets. The effect of these uncer¬ 
tainties depends on the analysis and the sample in question. The fj-tagging uncertainties are typically 
largest for the ttbb channels, having an effect of up to 10%. The uncertainty on the measurement 
from varying the c-jet and light jet mis-tagging rates is usually less than 1%, but may be larger for 
individual backgrounds. The uncertainties associated with t jets are less than 0.5% for all samples. 


6.3 Uncertainties on tt modelling 

A number of systematic uncertainties affecting the modelling of tt production are considered. In 
particular, systematic uncertainties due to the choice of parton shower and hadronisation models, the 
choice of generator, the choice of scale, the parton distribution function (PDE), and the inclusion 
of tiV and ttH events are considered. These systematic uncertainties are treated as fully correlated 
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between the various components of tt (e.g. between ttbX, ttcX and ttlX). The effect of assuming 
these uncertainties to be uncorrelated among the tt components is found to yield slightly smaller 
uncertainties on the measured cross-sections. As many of these uncertainties originate from similar 
physics processes, they are taken to be correlated. 

Parton shower: An uncertainty due to the choice of parton shower and hadronisation model is derived 
by comparing events produced by Powheg interfaced with Pythia 6.427 to Powheg interfaced with 
Herwig 6.520. The PowhegBox parameter hdamp was set to inhnity for this comparison for both 
samples. The difference between the samples is symmetrised to give the total uncertainty. 

Generator: An uncertainty due to the choice of generator is derived by comparing a tt sample gen¬ 
erated with MadGraph interfaced to Pythia 6 to a sample generated by PowhegBox-pPythia 6 . The 
MadGraph sample considered was produced with up to three additional partons. It used the CTIO 
PDF and was showered with Pythia 6.427. The difference between the samples is symmetrised to 
give the total uncertainty. 

Initial- and final-state radiation: An uncertainty on the amount of additional radiation is determined 
using samples generated with MadGraph interfaced to Pythia 6 but where the renormalisation and 
factorisation scales are doubled or halved in the matrix element and parton shower simultaneously, 
which covers the variations allowed by the ATLAS measurement of tt production with a veto on 
additional central jet activity [88]. The uncertainty is taken as half of the difference between the 
samples with higher and lower scales, relative to the central MadGraph prediction. 

Parton distribution function: The PDF and as uncertainties are calculated using the PDF4LHC 
recommendations [89] considering the full envelope of the variations of the MSTW2008 68% CL 
NLO [90, 91], CTIO NLO [28, 92] and NNPDF2.3 5 p FFN [93] PDF sets. Due to limitations in the 
information available in the Powheg event record, this systematic uncertainty is evaluated on a tt MC 
sample generated with MC@NLO [94-96] using Herwig 6.520 for the parton shower, AUET2 for the 
underlying-event tune and CTIO as the nominal PDF. 

Variation of ttV and ttH contributions: The signal in these analyses includes contributions from 
ttV and ttH in addition to QCD ttbb production. The relative proportion of these processes affects the 
fraction of ttbb events within the ttb templates, and the fractions of ttcc within the ttc and ttcX tem¬ 
plates. It additionally affects the calculation of the fiducial efficiency, due to the different kinematics 
of the fi-jets. In order to avoid making assumptions on the processes being measured, the effect of 
doubling or removing ttV and ttH is considered as an uncertainty. 

Table 8 summarises the MC samples used to evaluate the systematic uncertainties on the tt model¬ 
ling. 


6.4 Uncertainties on the non tt backgrounds 

An uncertainty of 6.8% is assumed for the theoretical cross-section of single top production [61, 62]. 
For the Wt channel, the diagram-removal scheme is applied in the default sample, in which all doubly- 
resonant NLO diagrams that overlap with the tt definition are removed [96] . The difference between 
this and an alternative scheme, inclusive diagram subtraction, where the cross-section contribution 
from Feynman diagrams containing two quarks is subtracted, is considered as a systematic uncer¬ 
tainty. 
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Uncertainty 

Generator 

PDF 

Shower 

Nominal 

PowhegBox 

CTIO 

Pythia 6.427 

PDF variations 

MC@NLO 

CTIO, 

Herwig 6.520 



MSTW2008 and 



NNPDF2.3 


Parton shower 

PowhegBox 

CTIO 

Herwig 6.520 

Generator 

MadGraph 

CTIO 

Pythia 6.427 

Additional radiation (x2, xl/2) 

MadGraph 

CTIO 

Pythia 6.427 


Table 8; Summary of the Monte Carlo event generator parameters for the tt samples used to evaluate the mod¬ 
elling uncertainties. For all PowhegBox samples version 1, r2330 is used. For MSTW2008 the 68% CL at NLO 
is used. 


Normalisation uncertainties for IF-i-jets and Z-i-jets backgrounds are set conservatively to 50%. The 
uncertainty on the diboson background rate is taken to be 25%. In the lepton-plus-jets and ttb efi 
analyses, a conservative uncertainty of 50% is used on the number of fake and non-prompt lepton 
events. Because the data samples are dominated by tt events, the effect of all of these uncertainties on 
the final resulf is small. 


7 Analysis methods 


The common components of the cross-section extraction for all analyses are presented in Section 7.1. 
Three of the four measurements presented make use of the distribution of the multivariate discriminant 
used for b-jet identification. These distributions are presented in Section 7.2. The profile likelihood 
fils applied in Ihe measuremenls of Ihe cross-secfion for ttb producfion in Ihe lepfon-plus-jets and 
efj. channels are presenfed in Seclion 7.3. The extraclion of Ihe cross-secfion for ttbb in fhe cuf- 
based approach is presenfed in Secfion 7.4. This is followed in Secfion 7.5 by fhe descriplion of fhe 
measuremenl of fhe same process using a lemplale hi. 


7.1 Cross-section extraction 

The cross-seclions for fiducial ttb and ttbb producfion are oblained from fhe best estimate of the 
number of signal events (Nsig)^ the fiducial efficiency (egd), and, where relevant, the correction for the 
absence of leptons in the fiducial region used in fhe femplafes (/fid)- The mefhod fo defermine N^ig is 
analysis specific and described in delail in each respeclive analysis seclion below. The fiducial effi¬ 
ciency is fhe probability for an event in the fiducial region of fhe femplafes lo meet all reconstruction 
and selection criteria. The correction factor /fid is defined as fhe frachon of selecled events satisfying 
the template definition that also meet the fiducial signal definifion. If is only needed for fhe ttb eju and 
ttbb dileplon hi analyses, which do nol include fhe leplon requiremenfs in fhe lemplate definilions as 
documented in Table 2; fhe ttb leplon-plus-jels analysis uses fhe same fiducial criferia for dehning 
fhe signal and building fhe femplafes, while fhe ttbb cul-based does nol make use of femplafes. The 
cross-section is given by 


fid _ 

^ ~ r ’ 


( 1 ) 


20 







where X is the integrated luminosity. 

The values for efid and /gd are given in Table 9. While the cut-based ttbb analysis has the highest 
signal-to-background ratio, due to the high requirement on the number of Z^-tagged jets (at least four 
instead of at least two), the fiducial acceptance is much smaller than in the other channels. 


Parameters 

ttb 

ttb 

ttbb 

ttbb 


lepfon-plus-jefs 

en 

cuf-based 

fil-based 


0.360+0.002 

0.358 ± 0.006 

0.0681+0.0036 

0.399 + 0.008 

/fid 

1 

0.969 ± 0.003 

- 

0.900 + 0.007 


Table 9: The fiducial efficiency (ega) and leptonic fiducial acceptance (/gg) for all analyses. The uncertainties 
quoted include only the uncertainty due to the limited number of MC events. 


7.2 Multivariate discriminant for Z>-jet identification 

The event selection for the three template fit analyses requires the presence of two or more (^-tagged 
jets. Relatively loose working points are chosen with fj-tagging efficiencies of ~80%, using fhe MVlc 
mulfivariafe algorifhm, because fhis allows for high efficiency and good signal-fo-background separa- 
fion. 

The disfribufion of fhe MVlc discriminanf for jefs wifh fhe fhird highesf, or fhird and fourlh highesf, 
MVlc weighfs is found fo have significanl shape differences befween fhe tt componenfs. The b- 
fagging probabilify disfribufion for fhese jefs has, on average, high values for ttb and ttbb evenfs, 
infermediafe values for evenfs wifh addifional c-jefs, and low values for tt evenfs wifh only addifional 
lighf jefs. 

The MVlc disfribufion is calibrafed fo dafa in five exclusive bins. These bin edges correspond fo fhe 
equivalenf cufs on fhe f?-jef idenfificafion wifh efficiencies of approximafely 80%, 70%, 60%, and 50% 
for f7-jefs from fop quark decays. 

The discriminanf used in fhe ttb analyses consisfs of fhe disfribufion of fhe MVlc of fhe jef wifh fhe 
fhird highesf MVlc weigh!, in fhe five calibrafed bins. The femplafes used for fhe lepfon-plus-jefs and 
ttb efi analyses are shown in fhe lefl and righf plofs of Figure 3, respecfively. 

For fhe dilepfon ttbb fil analysis, fhe MVlc disfribufions for fhe jefs wifh fhird and fourfh highesf 
MVlc weighfs are used. Since fhese are ordered, fhe weigh! of fhe fourfh jef is by consfrucfion 
smaller fhan fhaf of fhe fhird, resulting in 15 possible bins of fhe discriminanf. The disfribufion of fhe 
femplafes used in fhe fif is shown in Figure 4. 


7.3 Profile likelihood fit to extract the ttb cross-sections 

In fhe lepfon-plus-jefs and ttb efi channels, fhe numbers of evenfs in fhe ttb, ttc and ttl cafegories 
are obfained by tiffing fo dafa fhe femplafes of fhe fhird highesf MVlc weigh!. The fif is performed 
combining fhe evenfs from bofh e-i-jefs and //-i-jefs info a single sef of femplafes for fhe lepfon-plus-jefs 
analysis. 
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Figure 3; Distribution of the MVIc discriminant for the jet with the third highest MVIc weight in the lepton- 
plus-jets (left) and ttb efi (right) channels. The ttb signal distribution is compared to the distributions for 
backgrounds with an additional charm jet (ttc) and backgrounds with only additional light jets {ttl). The bin 
edges correspond to the h-tagging efficiency of the MVIc weight. The plots are normalised such that the sum 
over the bins is equal to unity. The statistical uncertainty of these distributions is negligible. 



4 th jet: i.o i.o 0.8 i.o o.e 0.7 t.o o.s 0.7 o.e i.o o.e 0.7 0.6 0.5 
3 rd jet: 1.0 0.8 0.8 0.7 0.7 0.7 0.6 0.6 0.6 0.6 0.5 0.5 0.5 0.5 0.5 

MVIc efficiency bin 


Figure 4; Distributions of the third and fourth highest MVIc weight among jets for ttbb signal, ttbX, ttcX and 
ttlX background. The bins are labelled with the upper edge of the efficiency point of the third highest and fourth 
highest MVIc scores in the event. The order of the bins does not affect the cross-section measurement, for this 
hgure the bins have been ordered by decreasing MVIc efficiency point of the fourth and third MVIc score. 
The plots are normalised such that the sum over the bins is equal to unity. The statistical uncertainty of these 
distributions is negligible. 


A binned likelihood function is constructed as the product of Poisson probability terms over all bins 
considered in the analysis. This likelihood depends on the signal-strength parameters, which are 
independent multiplicative factors of the MC predictions for ttb, ttc and ttl production cross-sections, 
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henceforth referred to as ^ttb, Mttc and The nominal prediction (pi - for each analysis is obtained 
from the PowhegBox tt sample. No constraints are applied to the values of these parameters. Nuisance 
parameters (denoted 6) are used to encode the effect of the various sources of systematic uncertainty 
on the signal and background expectations; these are implemented in the likelihood function with 
multiplicative Gaussian or log-normal priors. The likelihood is then maximised with respect to the 
full set of lu and 9 parameters. The values of these parameters after maximisation are referred to as fi 
and 9. The cross-section from Eq. (1) can be re-written as: 

9) ■ /(id 
£ • efid(0) 

The effects from the systematic uncertainties on both the shape and normalisation of the templates are 
considered, as well as the effect on the fiducial efficiency. In the ttb eju . analysis, the uncertainty on /fid 
is also taken into account. The impact of each systematic uncertainty on these different quantities are 
considered as correlated. 

Due to the large number of nuisance parameters considered, the likelihood fit only includes uncertain¬ 
ties with at least a 0.5% effect on the event yield, or shape uncertainties that cause a relative variation 
of more than 0.5% between two bins. This simplification changes the final result or uncertainty by 
less than 1 % and significantly reduces the execution time. 

The shape variations for the PDF uncertainties on tt in the lepton-plus-jets analysis are found to be 
negligible, therefore only the largest variation in acceptance is considered. In the ttb analysis, 
the PDF uncertainty is evaluated outside of the profile likelihood fit. For each eigenvector of each 
PDF set, new nominal templates are obtained for each of the components and a statistics-only fit to 
the Asimov dataset [97] obtained using the central value of the MC@NFO prediction is done. The 
relative difference between the fitted cross-section and the one obtained from the nominal MC@NFO 
is considered as the PDF uncertainty of that eigenvector. The envelope of all eigenvectors is then 
considered as the PDF uncertainty and added in quadrature to the total uncertainty obtained from the 
full profile likelihood fit. 

Figure 5 shows the MV Ic distribution used to fit the ttb signal strength in the lepton-plus-jets analysis 
(top) and ttb efi analysis (bottom). The left figure shows the predictions from simulation and the 
uncertainty band from the sum in quadrature of the impact of each source of uncertainty. The right 
plot shows the fitted results and the final uncertainty on the total prediction, which is largely driven by 
the size of the available MC samples. Table 10 shows the fitted values of the parameters of interest. 
The Asimov dataset is used to provide expected results. The total uncertainty on the measurement 
is found to be similar to the expected one in both analyses and the fitted ttb signal strength in both 
analyses is higher than one, but still compatible with unity within uncertainties. The impact of the ttc 
and ttl backgrounds on the measurement may be assessed by considering the correlation of jittb with 
fine or fini within the likelihood function. In the ttb efi analysis, the correlation is -0.5 between finb 
and fine, and -1-0.5 between finb and finf, in the lepton-plus-jets analysis, the correlation is -i-O.l in both 
cases. 

The effect of the dominant uncertainties on the fitted signal strength is illustrated in Figure 6. The 
post-fit effect on finb is calculated by fixing the corresponding nuisance parameter at 0 ± erg, where 9 
is the fitted value of the nuisance parameter and erg is its post-fit uncertainty, and performing the fit 
again. The difference between the default and the modified finb, ^fttb, represents the effect on finb 
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of this particular uncertainty. The dominant uncertainties on both of these measurements are from tt 
modelling and Zj-tagging uncertainties affecting the c-jets. In the lepton-plus-jets analysis, due to the 
large fraction of tt events where the VT-boson decays to a c-quark and a light quark, the effect of the 
^-tagging uncertainties on the c-jets is large. Other significant contributions come from the effect of 
^-tagging on Z^-jets and light jets, and the jet energy scale and resolution. The generator comparison 
shows a large effect on both the template shapes and normalisations; it is the dominant uncertainty for 
the ttb e/u analysis, while for the lepton-plus-jets analysis it is smaller due to a cancellation in these 
effects. 

Table 11 shows a summary of the uncertainties grouped into categories. The effect of each uncertainty 
is obtained as above and all sources of uncertainty within a category are added in quadrature to obtain 
the category uncertainty. The total uncertainty in the table is the uncertainty obtained from the full 
fit, and is therefore not identical to the sum in quadrature of each component, due to the correlations 
induced between the uncertainties by the fit. Nonetheless, these correlations are small enough that the 
difference is less fhan 3% in bofh analyses. In order fo obfain separafe esfimafes for fhe sfafisfical and 
sysfemafic componenfs of fhe fofal uncerfainfy in bofh profile likelihood fil analyses, fhe sfafisfical 
componenf of fhe uncertainly is evalualed by fixing all nuisance parameters fo fheir fitted values and 
re-evalualing fhe uncerfainfy on the fit. 


Fit parameter 

Lepton-plus-jets 

ttb 

ep 


Asimov 

data 

Asimov 

data 

^tth 

1 00 

^ - 0.24 

1 ao + 0.35 
- 0.27 

1 00 

- 0.30 

1 30 

- 0.35 

tine 

1 00 

- 0.21 

1-08^16 

i.uu _QJ2 

1 40 

- 0.78 

flttl 

+ •'+'+- 0.17 

1-00 tis 

1 00 +°'^ 

+ ■'+'+ - 0.11 

1 00 +°'"' 

+ ■'+'+ - 0.11 


Table 10: Fitted values for the parameters of interest for the signal strength for ttb, ttc and ttl in the lepton-plus- 
jets and ttb efi analyses. Both the results from the Asimov dataset and the values obtained from the fits to data 
are shown. The uncertainties quoted are from the total statistical and systematic uncertainties. 
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Figure 5: The MVlc distribution of jets with the third highest MVlc weight in the lepton-plus-jets analysis 
(top) and ttb ep analysis (bottom) for all signal and background components. The data are compared to the 
nominal predictions (Pre-fit) (left), and to the output of the fit (Post-fit) (right). The points include the statistical 
uncertainty on the data. The hashed area shows the uncertainty on the total prediction. The non-prompt and 
fake lepton backgrounds are referred to as ‘NP & fakes’. 
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Figure 6: Effect of the uncertainty on the fitted value of yu,,* (Afittb) and pull of the dominant nuisance parameters 
in the lepton-plus-jets (left) and ttb ep analyses (right). The shaded and hashed areas refer to the top axis: the 
shaded bands show the initial impact of that source of uncertainty on the precision of fifth', the hatched areas 
show the impact on the measurement of that source of uncertainty, after the profile likelihood fit at the +lcr 
level. The points and associated error bars show the fitted value of the nuisance parameters and their errors 
and refer to the bottom axis; a mean of zero and a width of 1 would imply no constraint due to the profile 
likelihood fit. Dashed lines are shown at 0 and + 1 for reference. Only the ten highest ranked uncertainties on 
fifth are shown. The index on the fi-tagging uncertainties refers to the fixed, but arbitrary position in the list of 
eigenvectors associated with each jet flavour. 
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Source 

fid 
^ ttb 

Lepton-plus-jets 

uncertainty 

(%) 

fid 
^ ttb 

ttb efi 
uncertainty 
(%) 

fid 

'-'ttbb 

Cut-based 

uncertainty 

(%) 

fid 

^ ttbb 

Fit-based 

uncertainty 

(%) 

Rttbb 

Fit-based 

uncertainty 

(%) 

Total detector 

h-17.5 -14.4 

+ 11.6-8.0 

+ 14.5 

+ 11.9-13.1 

+ 10.9-12.5 

Jet (combined) 

-h3.9 -2.7 

+ 10.1 -6.1 

+5.5 

+6.0 -8.5 

+8.7 -10.7 

Lepton 

+0.7 

+ 1.0 -0.5 

+2.0 

+2.4 -2.7 

+0.8 -1.6 

f>-tagging effect on f>-jets 

+4.4 -4.0 

+3.6-3.1 

+ 12.9 

+9.4 -9.0 

+6.0 -5.8 

f>-tagging effect on c-jets 

+ 16.2-13.4 

+4.0 -3.6 

+ 1.7 

+ 1.4 

+ 1.2-1.3 

f>-tagging effect on light jets 

+3.1 -2.0 

+ 1.9 -2.0 

+4.3 

+3.3 -2.9 

+2.2-1.9 

Total tt modelling 

+ 13.1 -13.7 

+23.8 -16.1 

+23.8 

+21.7 

+ 16.1 

Generator 

+ 1.1 -1.4 

+23.3 -15.1 

+ 16.9 

+ 17.4 

+ 12.4 

Scale choice 

+4.3 

+ 1.1 -2.7 

+ 14.2 

+9.5 

+6.0 

Shower/hadronisation 

+ 11.4-12.1 

+3.0 -3.4 

+8.2 

+8.7 

+7.1 

PDF 

+4.7 -4.5 

+3.3 

+3.3 

+0.8 

+4.1 

Removing/doubling ttV and ttH 

+0.4 

+ 1.1 -0.9 

+ 1.5 

+3.1 -2.7 

+3.0 - 2.6 

Other backgrounds 

+0.8 

+0.9 -0.8 

+ 1.6 

+3.5 -3.3 

+2.5 

MC sample size 

< 1 

< 1 

+9.6 

+7.4 

+7.4 

Luminosity 

+2.8 

+2.8 

+3.2 

+2.9 

+0.1 

Total systematic uncertainty 

+25.5 -19.2 

+30.5 -19.9 

+29.5 

+26.4 -26.9 

+21.1 -21.9 

Statistical uncertainty 

+7.1 

+ 19.2-17.9 

+ 18.4 

+24.6 

+25.2 

Total uncertainty 

+26.5 -20.5 

+36.0 -26.8 

+35.2 

+36.1 -36.4 

+32.9 -33.4 


Table 11: Effect of the various sources of uncertainty on the ttb and ttbb cross-section measurements in the 
lepton-plus-jets and dilepton channels. The uncertainties on the Ratb ratio measurement in the dilepton fit 
analysis are also shown. Asymmetric uncertainties are shown when relevant. For the fit-based measurements, 
the individual and total uncertainties are evaluated from the fit to the data. 
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7.4 ttbb cross-section from cut-based analysis 

This ttbb measurement uses an event counting method in the dilepton channel to extract the cross- 
section. Events with at least four identified f>-jets are considered. 

The estimate of the number of signal events is obtained from the total number of observed events 
passing the final selecfion (A^data) and fhe esfimafe of fhe number of background evenfs. A disfincfion 
is made befween background processes which confain fwo lop and fwo bolfom quarks, bul do nol 
pass fhe fiducial selecfion (referred lo as non-fiducial background), and backgrounds from all ofher 
processes (referred lo as non-ttbb). In order lo avoid making any assumplions aboul fhe cross-seclion 
for ttbb processes, fhe predicfion for fhe non-fiducial background is nol laken direclly from simulalion; 
inslead, simulalion is used lo delermine fhe fraclions of ttbb evenfs fhaf are signal and non-fiducial 
background. In parficular, fhe fraclion of parficle-level ttbb evenfs lhal pass fhe fiducial selecfion, /sig, 
is defined as 


/sig “ 


N. 


sig 


. _|_ ^non-fiducial 

^ ttbb 


The cross-section from Eq. (1) can then be re-written as 


(^data ^non-ttbb) ' fsig 
~ X-efid ■ 

In order fo classify background evenfs as non-fiducial or non-ttbb, an alfempl is made lo malch fhe 
four reconslrucled A-jefs fo parficle-level jels.^ If fwo or more of fhe reconsfrucled A-fagged jels 
malch lighl-flavour or charm parficle-level jels, fhen fhe evenl is classified as non-ttbb, olherwise if is 
considered as ttbb non-fiducial. 

The predicfion for fhe non-ttbb backgrounds is laken from simulalion. The prediction has been val- 
idafed by repealing fhe calculafion wilh differenl definifions of fhe signal region, based on fhe A-jels 
wilh fhe fourlh-highesl value in MV 1. These alfemalive signal regions vary in fhe fraclion of non- 
ttbb backgrounds from less lhan 1% lo more lhan 50%. Nonelheless, fhe measured cross-seclions 
among fhe regions agree wilhin Iheir slafisfical uncerlainlies, giving confidence lhal fhe Monle Carlo 
simulalion provides a sufficienl descriplion of fhese backgrounds. 

For fhe calculations of egd and /sig, bolh elecfroweak {ttZ and ttH) and QCD production are con¬ 
sidered, weighfed according lo Iheir Iheorefical cross-seclions. The values of fhe paramelers Vdata> 
Nnon-ttbb, ffid^ and /gig are shown in Table 12, fogelher wilh Iheir uncerlainlies. 

Each source of syslemalic uncerlainfy is propagaled fo fhe cross-seclion measuremenf in a coherenl 
way by varying simulfaneously fhe effecl on fhe background predicfion, on /sig and on where 
applicable. A symmelrisafion of fhe uncerlainlies is carried oul; for uncerlainlies for which Ihe pos¬ 
itive and negative variations differ (in absolute value) by less lhan 0.5%, Ihe larger of Ihe Iwo is used 
for bolh variations. The middle column of Table 11 shows Ihe effecl of fhe dominanl sources of 
uncerlainfy on Ibis cross-section measurement 


^ The matching is carried out hy considering the closest particle-level jet lying AR < 0.4 from the reconstructed jet. 
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Parameter 

Value 

f^data 
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^non-ttbb 

3.9 

± 1.0 

(sfal.) 

+ 1.5 
-1.7 

(syst.) 

/sig 

0.806 

+ 0.060 (slal.) 

+ 0.061 (sysl.) 

efid (%) 

6.8 

+ 0.4 

(slal.) 

+ 1.5 
-0.9 

(syst.) 


Table 12; The number of observed data events A^data, the predicted non-ttbb background Atnon-ttbb, the signal 
fraction /sig, and the fiducial efficiency in the ttbb cut-based measurement. The numbers include ffV and 
tIH as signal. 


7.5 Maximum-likelihood fit to extract the ttbb cross-section 

The looser event selection used in this analyses allows a template fit to be performed in the 15 pop¬ 
ulated bins of the MVlc distribution for the jets with the third and fourth highest MVlc values. A 
maximum-likelihood fit to the nominal templates of ttbb, ttbX, ttcX, ttlX and non-tt background is 
carried out to extract the number of signal events in each category. Systematic uncertainties are not 
included in the likelihood. The cross-section is then extracted directly from Eq. (1). 

This analysis also allows an extraction not only of the ttbb signal but also of the ttbX, ttcX, ttlX 
contributions and of the ratio of ttbb to the total ttjj yield: 

„ cr ttbb 

^ttbb ~ •> 

^ttjj 

where ttjj refers to tt production with two additional jets. The cross-section for ttjj is obtained by 
correcting the ttbb, ttbX, ttcX and ttlX cross-sections, which are calculated for events with three or 
four particle-level jets, to the fraction with four jets only. For ttbb the fiducial efficiency and fraction 
as documenfed in Table 9 are used; for ttbX, ttcX and ttlX fhe fiducial efficiencies and fracfions are 
shown in Table 13. 


Parameter 

ttbX 

ttcX 

ttlX 

ffid 

0.197 + 0.003 

0.177 + 0.002 

0.0355 + 0.0001 

/fid 

0.898 + 0.005 

0.899 + 0.003 

0.902 + 0.001 


Table 13; The fiducial efficiency (efid) and leptonic fiducial acceptance (/m) for the ttbX, ttcX and ttlX categories 
as used in the ttbb fit-based analysis. The uncertainties quoted include only the uncertainty due to the limited 
number of MC events. 


Figure 7 shows the MVlc distribution used to fit the number of ttbb events; the left figure shows fhe 
predictions from simulafion compared fo fhe observed disfribution in dafa; fhe righf plof shows dafa 
compared fo fhe resulf of fhe fif. The tiffed cross-secfions for each of fhe componenfs are shown in 
Table 14 along wifh fhe predicfions from PowhegBox-i-Pythia6; fhe uncerfainfies shown are fhe sfaf- 
isfical uncerfainly of each componenf as obfained from fhe til. The tiffed cross-sections are compatible 
wifh fhe predictions wifhin til uncerfainfies. The cenlral value for ttbb is 1.1 limes fhe predicfions from 
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PowhegBox+Pythia 6, consistent with the ji values found in the two ttb analyses. In particular the val¬ 
ues for the ttbX ttcX and ttlX may be used to cross-check the assumptions made about the background 
contributions to the cut-based analysis. 
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Figure 7; The MVlc disU'ibution of jets with the third and fourth highest MVlc weight in the dilepton channel 
for all signal and background components. The bins are labelled with the upper edge of the efficiency point 
of the third highest and fourth highest MVlc scores in the event. The data (left) are compared to the nominal 
predictions (Pre-fit), and (right) to the output of the fit (Post-fit). The points include the statistical uncertainty 
on the data. The hashed area shows the total uncertainties. The bottom sub-plot shows the ratio of the data to 
the prediction. The non-prompt and fake lepton backgrounds are referred to as ‘NP & fakes’. 


Process 

Observed 
cross-section [fb] 

Statistical 
uncertainty (%) 

Systematic 
uncertainty (%) 

Total 

uncertainty (%) 

Predicted 
cross-section [fb] 

ttbb 

13.5 

+25 

+27 

+36 

12.3 

ttbX 

61 

+38 

+69 

+79 

63 

ttcX 

270 

+25 

+81 

+85 

180 

ttlX 

5870 

+4 

+ 14 

+ 15 

5800 

Rttbb 

1.30% 

+25 

+22 

+33 

1.27% 


Table 14: Observed and predicted cross-sections for each of the components measured in the ttbb fit analysis 
and on the R„hh ratio. The statistical, total systematic, and total uncertainties on each component are also shown. 
The predicted values are from Powheg-i-Pythia 6 tt. 


For most sources of systematic uncertainty, the templates for signal and background distributions are 
obtained from the event sample where a ±10" shift of the uncertainty was applied. The new templates 
and the old templates are fitted to the nominal MC sample, and the relative difference between the 
yields is taken as the uncertainty on the number of events. For systematic uncertainties that also affect 
the fiducial efficiencies, the efficiency is varied coherently and the effect on the final cross-section is 
obtained. The effect due to limited number of MC events in the templates is obtained from the mean 
of 5000 pseudo-datasets obtained from simulation, where the variance of each bin depends on the 
total MC statistical uncertainty of that bin. The second to last column of Table 11 shows the effect 
on the final ttbb cross-section measurement in this analysis whereas the rightmost column shows the 
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uncertainties on the Rntb measurement. 


The total cross-section uncertainty of each process and on the Rnbb ratio are shown in Table 14 along 
with the statistical and total systematic uncertainties. The uncertainties on the ttbX and ttcX processes 
are large and do not allow the cross-sections of these processes to be constrained significantly. The 
signal strength fittbb has a correlation of 0.4 with finbx, -0.1 with i^ttix, and nearly 0 with ^ttcx- 


8 Results 

The fiducial cross-sections obtained for each analysis in the previous section are shown in Table 15. 

The measurements of the ttb cross-section in the lepton-plus-jets and ttb efi analyses are both higher 
than the predicted cross-section from the Powheg-i-Pythia 6 sample, with a best fit value for the signal 
strength fittb of 1.32 and 1.30, respectively. The total measurement uncertainty in the lepton-plus-jets 
channel is fractionally smaller than in the ttb efi analysis, ~25% compared to ~32%, owing to the 
higher acceptance times branching ratio of this decay channel. The uncertainty in this channel is 
dominated by uncertainties on the tagging efficiency due to c-jets from events in which the W boson 
decays to a c- and a light quark. 

The two measurements of the ttbb cross-section show similar precision despite the different ap¬ 
proaches, with the cut-based and fit-based analyses having a total uncertainty of ~35% and ~36%, 
respectively. The cut-based analysis is largely insensitive to the modelling of the non-ttbb background 
from tt events as the selection criteria are very tight. In contrast, the fit-based analysis uses looser se¬ 
lection criteria in an attempt to obtain a data-driven constraint on these processes. While the precision 
of the fit-based analysis does not allow for a measurement of these backgrounds, it does confirm fhe 
validify of fhe simulation, and allows for an explicif measuremenf of fhe Rnbb ratio. The fwo ttbb 
measuremenfs selecf differenf evenfs and hence are nof fully correlafed. A small excess of dafa wifh 
respecf fo fhe nominal predicfion is seen in fhe evenfs fhaf are common fo bofh measuremenfs, while 
a small deficil is seen for evenfs wifh jefs thaf satisfy fhe MVlc 80% criferion buf fail fhe MVl 70% 
criferion thaf is used in fhe cuf-based analysis. These fwo feafures explain fhe difference befween fhe 
observed cross-section in fhe fwo analyses. 

An alfernafive sef of resulfs is obfained by subfracfing fhe predicted ttV and tlH confribufion from fhe 
signal; no addifional uncerfainfy due fo fhe cross-section of fhese processes is considered. This allows 
a direcf comparison of fhe measuremenfs fo QCD-only predictions, alfhough wifh assumpfions abouf 
fhe ttV and ttH cross-secfions. These resulfs are summarised in Table 16 and Figure 8 and compared 
fo fheorefical predictions obfained wifh fhe generators described in Secfion 4.4 and shown in Table 4. 
The rafio of fhe ttbb and ttjj cross-secfions as measured in fhe ttbb fif-based analysis is compared fo 
fheorefical predicfions in Figure 9. The uncerfainfies on fhe fheorefical predictions are obfained by 
simulfaneously varying fhe renormalisafion and facforisafion scales by a factor of fwo. 

The predicfions confaining NLO mafrix elemenfs for fhe pp —> ttbb process, as well as fhe merged 
LO-i-PS prediction from MadGraph-ePythia 6 are in agreemenf wifh fhe measured cross-secfions 
wifhin fhe measuremenf uncerfainfies. The cross-secfions obfained in fhe 5FS (Powhel) are higher 
fhan fhe 4FS ones (MadGraph5_aMC@NLO) as expected, however fhe fwo predicfions agree wifhin 
fhe respecfive scale uncerfainfies. The models utilizing softer choices for fhe renormalisafion/facforisafion 
scales show fhe besf agreemenf wifh fhe dafa. Differenf g ^ bb splitfing models significanlly affecl 
fhe ttbb and ttb cross-secfions in fhe samples where all addifional f7-jefs come from fhe parfon shower. 


31 


Analysis 

Measured 

Predicted 


Cross-section [fb] 

Cross-section [fb] 

^ lepton-plus-jets 

950 ± 70 (stat.) (syst.) 

720 

O' ttb efi 

50 ± 10 (stat.) ;|;}q (syst.) 

38 

0~ttbb cut-based 

19.3 ± 3.5 (stat.) ± 5.7 (syst.) 

12.3 

O' ttbb fit-based 

13.5 ± 3.3 (stat.) ± 3.6 (syst.) 

12.3 

^ttbb 

1.30 ± 0.33 (stat.) ± 0.28 (syst.) % 

1.27 % 


Table 15: Measured fiducial cross-section for ttb in the lepton-plus-jets and eji channels, and ttbb in the dilepton 
channel using a cut-based or a fit-based method. Results for the R„hh ratio measurement from the ttbb fit-based 
method are also shown. The uncertainties quoted are from the statistical and total systematic uncertainties. The 
predicted cross-section is from PowhegBox with Pythia 6 for the QCD component, from Helac for ttH and 
from MadGraph 5 for ttV. 



ttbb 

ttb Lepton-plus- 

ttb eji 

Rttbh 


[fb] 

jets [fb] 

[fb] 

(%) 

Observed 

(cut-based) 18.2 +3.5 +5.7 

(fit-based) 12.4 +3.3 +3.6 

930 +70 +240 

48 +10 

1.20 +0.33 +0.28 

Madgraph5_aMC@NLO (/ibddp) 


870^220 

49+18 

^^-15 

_ 

Madgraph5_aMC@NLO (jj.Ht/4) 

1 9 0+4.4 
^^•*^-3.6 

520!!™ 

30!™ 

- 

PoWHEL 

9 1+4.5 

430+250 

^-’'4-150 

27!™ 

- 

Madgraph5-i-Pythia 6 

13.3!3| 

790+270 

43!™ 

1 79+0.15 

Pythia 8 (wgtq=3) 

30.1 

1600 

88 

2.50 

Pythia 8 (wgtq=5) 

12.8 

740 

42 

1.10 

Pythia 8 (wgtq=6,sgtq=0.25) 

16.1 

930 

53 

1.37 

PoWHEG-I-PyTHIA 6 (HDAMP=mtop) 

11.2 

690 

37 

1.16 


Table 16: Observed and predicted cross-sections for the three fiducial phase-space regions. The measurements 
are shown with the contributions from ffV and ttH removed to allow direct comparison to the predictions 
containing only the pure QCD matrix elements. Results for the Rt,hh ratio measurement from the ttbb fit-based 
method are also shown. The measurement uncertainties are separated into statistical (first) and systematic 
(second) uncertainties. The uncertainties on the theoretical predictions are obtained by simultaneously varying 
the renormalisation and factorisation scales by a factor of two up or down. These variations have not been 
calculated for the LO Pythia 8 samples or for the Powheg-hPythia 6 sample. 


The predictions corresponding to wgtq=3 and wgtq=5, which correspond to the extreme models, dif¬ 
fer by more than a factor of two. The cross-sections obtained with the wgtq=3 model are significantly 
higher than the measured ones, thus indicating that this model overestimates the g ^ bb rate. The 
cross-sections obtained with the other models are both in agreement with the data. 
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Figure 8: Comparison of the measured cross-sections in the three fiducial phase-space regions with theoretical 
predictions obtained from a variety of different generators. The measurements are shown with the contribu¬ 
tions from ffV and tIH removed to allow direct comparison to the predictions containing only the pure QCD 
matrix elements. The coloured bands indicate the statistical and total uncertainties of the measurements. The 
errors on the theoretical prediction are obtained by simultaneously varying the renormalisation and factorisa¬ 
tion scales by a factor of two. These variations have not been calculated for the LO Pythia 8 samples or for the 
Powheg-i-Pythia 6 sample. 
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Figure 9: Comparison of the measured ratio of the ttbb and tt j j cross-sections in the fiducial phase-space region 
of the ttbb fit-based analysis with theoretical predictions obtained from a variety of different generators. The 
measurements are shown with the contributions from ffV and ttH removed to allow direct comparison to the 
pure QCD generators. The coloured bands indicate the statistical and total uncertainties of the measurement. 
The error on the MadGraph-i-Pythia prediction is obtained by simultaneously varying the renormalisation and 
factorisation scales by a factor of two. These variations have not been calculated for the LO Pythia 8 samples 
or for the Powheg-i-Pythia 6 sample. 
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9 Conclusions 


Measurements in the fidueial phase spaee of the deteetor of the eross-seetions for the produetion of tt 
events with one or two additional Z?-jets are performed in proton-proton eollisions at a eentre-of-mass 
energy of 8 TeV at the LHC. The results are based on a dataset eorresponding to an integrated lumin¬ 
osity of 20.3 fb“\ eolleeted with the ATLAS deteetor. The eross-seetion times branehing ratio for top 
pair events with at least one additional b-]et is measured to be 950 ± 70 (stat.) (syst.) fb in the 
lepton-plus-jets ehannel and 50 ± 10 (stat.) (syst.) fb in the ejj. ehannel. The eross-seetion times 
branehing ratio with at least two additional (j-jets is measured to be 19.3 ± 3.5 (stat.) ± 5.7 (syst.) fb 
in the dilepton ehannel (e/u, fj.fi, andee) using a method based on tight seleetion eriteria, and 13.5 ± 
3.3 (stat.) ± 3.6 (syst.) fb using a looser seleetion whieh allows extraetion of the baekground nor¬ 
malisation from data. A measurement of the ratio of tt produetion with two additional Z^-jets to tt 
produetion with any two additional jets is also performed; this ratio is found to be 1.30 ± 0.33 (stat.) 
± 0.28 (syst.)%. The measurements are found to agree within their uneertainties with NLO-i-PS eal- 
eulations of the pp ttbb proeess, as well as with merged LO-i-PS ealeulations of pp tt+ < 3 jets, 
favouring the predietions obtained with soft renormalisation/faetorisation seales. The measurements 
are shown to be sensitive to the deseription of g ^ bb splitting in the parton shower, with the most 
extreme Pythia 8 model being disfavoured by the measurements. 
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